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Method for the development of gene panels for diagnostic and 
^erapeutic purposes based on the expression and methylation 
^us of the genes 

DESCRIPTION 
FIELD OF THE INVENTION 

The present invention concerns a method for the develop- 
, ment of gene panels for diagnostic and therapeutic purposes. 

The invention further concerns gene panels developed using the 




method of the present invention and their uses. 
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BACKGROUND OF THE INVENTION 

The levels of observation that have been well studied by 
the methodological developments of recent years in molecular 
biology include the gene itself, the translation of genes in 
RNA, and the resulting proteins. When, during the course of the 
development of an individual, a gene is switched on, and how 
the activation and inhibition of certain genes in certain cells 
and tissues is controlled, can be correlated with a high degree 
of probability with the extent and the character of the methy- 
lation of the gene or the genome. In this regard, it is reason- 
able to assume that pathogenic conditions are expressed in a 
modified methylation pattern of individual genes or of the ge- 
nome . 

STATE OF THE ART 
1. State of the art of molecular analysis of cell phenotypes 

The study of gene expression can be at the RNA level or at 



the protein level. Both levels in principle reflect important 
phenotypic parameters. Protein assays using two-dimensional 
gels (McFarrel method) have been known for approximately 15 
years. Using these assays, it is possible to elaborate the 
analysis of the chromatographic positions of several thousand 
proteins. Very early on, such electropherograms were already 
processed or evaluated with data processing means. In princi- 
ple, the validity of the method is high, however, it is infe- 
rior to the modern methods of gene expression based on RNA 
analysis in two regards. 

The global analysis of cellular proteins has recently been 
termed proteomics and is a key area of research that is devel- 
oping in the post-genome era. Proteomics uses a combination of 
sophisticated techniques including two-dimensional (2D) gel 
electrophoresis, image analysis, mass spectrometry, amino acid 
sequencing, and bio-informatics to resolve comprehensively, to 
quantify, and to characterize proteins (for reviews, see Cham- 
bers G et al. /^Proteomics: a new approach to the study of dis- 
ease^' J Pathol 2000 Nov; 192 (3) : 280-8; Banks RE et al. „Pro- 
teomics: new perspectives, new biomedical opportunities^^ Lancet 
2000 Nov 18;356(9243) :1749-56) . 

Proteomics is further said to contribute greatly to the 
understanding of gene function in the post-genomic era. Pro- 
teomics can be divided into three main areas: (1) protein mi- 
cro-characterization for large-scale identification of proteins 
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and their post-translational modifications; (2) 'differential 
display' proteomics for comparison of protein levels with po- 
tential application in a wide range of diseases; and (3) stud- 
ies of protein-protein interactions using techniques such as 
mass spectrometry or the yeast two-hybrid system. Because it is 
often difficult to predict the function of a protein based on 
homology to other proteins or even their three-dimensional 
C3 structure, determination of the components of a protein complex 

y^l or of a cellular structure is central in functional analysis 

i= (Pandey A et al. ^Proteomics to study genes and genomes.''' Na- 

S"^ ture 2000 Jun 15; 405 (6788) : 837-46) . 

f3SV 

C)ue to the complexity of higher eukaryotic cells, single- 

ul 

'^f step characterization of a proteome is likely to be difficult 

to achieve. Jung et al. (^Proteomics meets cell biology: the 
establishment of subcellular proteomes" Electrophoresis 2000 
Oct; 21 (16) : 3369-77) describe, that advantage can be taken of 
the macromolecular architecture of a cell, e.g., subcellular 
compartments, organelles, macromolecular structures and multi- 
protein complexes, to establish subcellular proteomes. 

Despite the recent developments in the art, the detection 
of proteins that are of regulatory importance, from small quan- 
tities of cells still fails because of the fact that the sensi- 
tivity of the methods used is much too low. Indeed, in contrast 
to nucleic acids, proteins cannot be amplified. In addition, 
the method is very complex, not amenable to automation, and 



very expensive. In contrast, RNA analysis presents considerable 
advantages, and due to of the use of PGR it is more sensitive. 
Above all, each RNA species recognized to be important can be 
identified immediately by its sequence. 

Overexpression or underexpression of individual RNAs with 
a known sequence can usually be easily detected; however, in 
connection with the applications discussed here, they are only 
valid in exceptional cases. 

The method of "differential displays" at best allows a 
semiquantitative study of expression. Expression products am- 
plified by PGR are separated by gel electrophoresis. The valid- 
ity is limited as a result of the resolution of the gel elec- 
trophoresis. In addition, the method is insufficiently sensi- 
tive and robust for use in routine diagnosis (Liang, P. and 
Pardee, A. B., Science 257, 967-971). 

Genes with high overexpression or underexpression are fre- 
quently identified by subtractive techniques. Here, cDNA clones 
of a cell or tissue species to be examined are plated. Against 
the clones, cDNA is hybridized as comparison material. Expres- 
sion patterns cannot be reliably prepared using this technique. 

Sturtevant et al. ( ^Applications of differential-display 
reverse transcription-PGR to molecular pathogenesis and medical 
mycology. Glin Microbiol Rev 2000 Jul; 13 (3) : 408-27) describe 
the characterisation of host-fungus interactions by changes in 
gene expression in both host and pathogen. Differential-display 



reverse transcription PGR (DDRT-PCR) is a PCR-based method that 
allows extensive analysis of gene expression among several cell 
populations. DDRT-PCR has been used to address biological ques- 
tions in mammalian systems, including cell differentiation, 
cell activation, cell stress, and identification of drug tar- 
gets. In microbial pathogenesis and plant pathogenesis, DDRT- 
PCR has allowed the identification of virulence factors, genes 
involved in cell death, and signaling genes. Further, To 
(„Identif ication of differential gene expression by high 
throughput analysis^^ Comb Chem High Throughput Screen 2000 
Jun; 3 (3) : 235-41 ) describes the high throughput analysis of dif- 
ferential gene expression as a powerful tool that can be ap- 
plied to many areas in molecular cell biology, including dif- 
ferentiation, development, physiology, and pharmacology. In re- 
cent years, a variety of techniques have been developed to ana- 
lyze differential gene expression, including comparative ex- 
pressed sequence tag sequencing, differential display, repre- 
sentational difference analysis, cDNA or oligonucleotide ar- 
rays, and serial analysis of gene expression. Similar strate- 
gies are described by Getting („Gene expression analysis. Pig- 
ment Cell Res 2000 Feb; 13 ( 1) : 21-7) and Watson ( „Dif f erential 
cDNA screening strategies to identify novel stage-specific pro- 
teins in the developing mammalian brain. Dev Neurosci 
1993;15(2) :77-86) . 

One activity of the American "Human Genome Project" is the 



systematic sequencing of expressed genes. The data obtained 
from this can be used to build expression chips, which allow 
the study of practically all expressed sequences of a cell or 
tissue type in a single experiment. 

2 . State of the art in the analysis of cancer diseases 

Mutations in genes can trigger cancer diseases, that is, 
cell transformation. The causes of these mutations can be ex- 
ogenous influences, or events in the cell. In a few exceptional 
cases, an individual mutation, which frequently affects larger 
regions of the genome (translocations, deletions) , results in 
the degeneration of the cell; but in most cases a chain of mu- 
tations on different genes is involved, and it is only their 
combined effect that results in the malignant disease. These 
changes at the DNA level are also reflected on the RNA and pro- 
tein levels. In this context, it is highly probable that a mul- 
tiplication occurs, because it is certain that in many cases 
the quantity and type of one RNA influences the extent of the 
synthesis of several other RNA species. This leads to a change 
in the synthesis rates of the corresponding proteins, which, in 
turn, can result in the deregulation of metabolism, and thus 
initiate the mechanism of regulation and counter regulation. 
The result is a gene expression pattern of the cells in ques- 
tion, that has been modified in a very specific (but largely 
nondeterminable) manner, the specificity is for a certain carci- 
noma, for the stage of the carcinoma, and the degree of malig- 
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nancy of the carcinoma. So far, such phenomena have been out- 
side the realm of study of natural sciences. Indeed, it has 
been impossible to examine the gene expression or the metabo- 
lism of a cell in its totality. Chip technology for the first 
time provided such a possibility (Schena, M. et al., Science 
270, 467-470). 

If one wishes to solve the diagnostic problem of early di- 

S : 
fast 

p agnosis of tumors on the molecular level, then one is con- 

Q 

□3 fronted, today, with an insurmountable difficulty. With very 

v.! 

few exceptions, for most tumors, the knowledge of the molecular 

SI 

01 events, that is, the different mutations, is only fragmentary, 

s 

p Researchers do not know what to look for in medical examination 

Ul 

Lij material. This means it is absolutely impossible to apply the 

£3 

□ remarkable sensitivity and specificity of the polymerase chain 

m 

reaction. Examples are certain intestinal tumors, Swing's sar- 
coma, and certain forms of leukemia, which are in fact each de- 
fined by a single, precisely described mutation. In those 
cases, it is possible to identify the degenerated cell among 
millions of normal cells. However, even within these apparently 
unambiguously defined tumor groups, there are such differences 
in the behavior that the conclusion must be drawn that addi- 
tional unknown genetic parameters (such as, for example, the 
genetic background of the individual) play an important role. 
Immunological tumor markers are helpful auxiliary parameters, 
but they continue to make only a modest contribution, in addi- 



tion to the other conventional diagnostic parameters. However, 
they can be used for the purpose of preselecting suspect cells. 

Histology plays an important and indispensable role in the 
identification of degenerated tissues, but not precisely in 
early diagnosis. 

Thus, because most tumors are not sufficiently character- 
ized for diagnostic purposes on the molecular level, as a rule, 
no possibilities exist to proceed to a subdivision into stages 
or even a subdivision by degrees of risk. Such a subdivision, 
however, is an absolute prerequisite for an improved selection 
of treatments and, above all, for the development of effective 
new drugs and of gene therapy. 

3. State of the art in research on the number, type and proper- 
ties of the possible stable states of cells of higher organisms 

In recent times, there has been an increase in the number 
of indications that complex regulatory systems (an excellent 
example of which is cell regulation) , when left alone, can ex- 
ist in only a limited number of stable states, above a critical 
minimum complexity and below a critical maximum connectivity 
(of the average number of the components, with which any given 
component is connected) (Kauffman, S. A., Origins of Order, Ox- 
ford University Press, 1993) . In this context, the word state 
should be understood as the concept of selection for the gen- 
eral phenomenon. In connection with cells as biological regula- 
tory systems, one can also talk of differentiation state or 
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cell type. Although no such connection has been demonstrated — 
and even a mere limitation of the possible states for biologi- 
cal systems has not been demonstrated — the practical implica- 
tions would be of very great importance: if, regarding the con- 
stant information content of the cells of an organism (de 
facto, such constancy essentially exists within one species), 
there were only a limited number of stable states, then it 

□ would be likely that degenerated cells could also be in only 

B 

gj one of these states or in a transition between the possible 

<l 

JZ states. At this time, it is not possible to define these states 

Si 

gi on a molecular basis. It is hardly possible to achieve a corre- 

s 

p lation between the individual states and the behavior of the 

m 

yj cells according to the state of the art. However, such an 

□ 

□ analysis could make decisive contributions to the diagnosis and 
prognosis of diseases. It is even possible that a correlation 
could be established between the possible states of diseased 
cells and the best suited therapy. Furthermore, it is probable 
that such a method could also have a decisive influence in the 
selection of the time of treatment. For example, if one were to 
discover that the cells of a tumor are in a transition between 
possible states, one could assume that such a population of 
cells would be more likely to yield to the selection pressure 
resulting from the treatment, and thus could escape more eas- 
ily. A cell population in such a scenario, within such transi- 
tional states, would have a considerably increased flexibility. 
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and it would be easily forced into a possible stable state, in 
which the selection pressure would be eliminated, and the 
treatment would thus be without effect. 

A method which could classify cells and cell groups ac- 
cording to states would then also contribute to recognizing, 
understanding and possibly solving such problems. However, ac- 
cording to the state of the art, it is not possible to deter- 

a . 

p mine whether only a limited number of states of cells exists. 

Q 

m It follows that it is not possible to differentiate groups of 

JS cells according to an abstract criterion concerning their 

m 

Si states, and to predict these states with a certain behavior of 

p the cells . 

Ul 

yj 4 . Hereditary diseases 

□ Today, the genetic map of the human genome comprises 2500 

so-called microsatellites . These instruments are used to locate 
a multitude of genes, usually genes whose defect causes a ge- 
netic disease, per linkage analysis, and then to identify them. 
Common genetic diseases caused by a single defective gene are 
thus elucidated, from the point of view of the geneticist's 
principle, polygenic diseases should also be understood in this 
manner. Many polygenic diseases are very common, so common that 
they are included among the so-called wide-spread diseases. 
Asthma and diabetes are examples. Many carcinoma types are also 
included. The use of the above described strategy of linkage 
analysis also produced enormous initial successes. In many in- 



stances, numerous causal genes of important polygenic diseases 
such as diabetes, schizophrenia, atherosclerosis and obesity 
have been found. Besides the availability of the molecular bi- 
ology laboratory techniques proper, the availability of a rela- 
tively large number of patients and relatives affected by each 
disease is a crucial prerequisite for genetic elucidation. In 
the past two years it has become apparent that the number of 
several hundred patients that were originally used for the 
linkage analysis of polygenic diseases very likely is too low 
by one order of magnitude. This applies, in any case, to cases 
where the entire spectrum of the causal gene is to be eluci- 
dated. Because the level of manual work required for such a 
linkage analysis is extraordinarily high, only very slow pro- 
gress can be expected in the analysis of polygenic diseases. 
Alternative strategies are sought because it is precisely these 
diseases that are of enormous social and economic importance. 
5. State of the art in methylation analysis 

The modification of the genomic base cytosine to 5'- 
methylcytosine represents the epigenetic parameter which to 
date is the most important one and has been best examined. Nev- 
ertheless, methods exist today to determine comprehensive geno- 
types of cells and individuals, but no comparable methods exist 
to date to generate and evaluate epigenotypic information on a 
large scale. 

In principle, there are three methods that differ in prin- 
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ciple for determining the 5-methyl state of a cytosine in the 
sequence context. 

The first method is based in principle on the use of re- 
striction endonucleases (RE), which are methylation-sensitive" . 
REs are characterized in that they produce a cut in the DNA at 
a certain DNA sequence which is usually 4-8 bases long. The po- 
sition of such cuts can be detected by gel electrophoresis, 
transfer to a membrane and hybridization. Methylation-sensitive 
means that certain bases within the recognition sequence must 
be unmethylated for the step to occur. The band pattern after a 
restriction cut and gel electrophoresis thus changes depending 
on the methylation pattern of the DNA. However, most CpG that 
can be methylated are outside of the recognition sequences of 
REs, and thus cannot be examined. 

The sensitivity of this method is extremely low (Bird, A. 
P., Southern, E. M., J, Mol. Biol. 118, 27-47). A variant com- 
bines PGR with this method; an amplification by two primers lo- 
cated on both sides of the recognition sequence occurs after a 
cut only if the recognition sequence is in the methylated form. 
In this case, the sensitivity theoretically increases to a sin- 
gle molecule of the target sequence; however, only individual 
positions can be examined, at great cost (Shemer, R. et al., 
PNAS 93, 6371-6376) . 

The second variant is based on the partial chemical cleav- 
age of whole DNA, using the model of a Maxam-Gilbert sequencing 
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reaction, ligation of adaptors to the ends thus generated, am- 
plification with generic primers, and separation by gel elec- 
trophoresis. Using this method, defined regions having a size 
of less than thousands of base pairs can be examined. However, 
the method is so complicated and unreliable that it is practi- 
cally no longer used (Ward, C, et al . , J. Biol. Chem. 265, 
3030-3033) . 

A new method for the examination of DNA to determine the 
presence of 5-methylcytosine is based on the specific reaction 
of bisulfite with cytosine. The latter is converted under ap- 
propriate conditions into uracil, which, as far as base pairing 
is concerned, is equivalent to thymidine, and which also corre- 
sponds to another base. 5-Methylcytosine is not modified. As a 
result, the original DNA is converted in such a manner that me- 
thylcytosine, which originally could not be distinguished from 
cytosine by its hybridization behavior, now can be detected by 
"normal" molecular biological techniques. All of these tech- 
niques are based on base pairing, which can now be completely 
exploited. The state of the art, as far as sensitivity is con- 
cerned, is defined by a method which includes the DNA to be ex- 
amined in an agarose matrix, intended to prevent the diffusion 
and renaturing of the DNA (bisulfite reacts only with single- 
stranded DNA) and to replace all precipitation and purification 
steps by rapid dialysis (Olek, A., et al., Nucl. Acids. Res. 
24, 5064-5066) . Using this method, individual cells can be ex- 
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amined, which illustrates the potential of the method. However, 
so far only individual regions up to approximately 3000 base 
pairs in length have been examined, and an overall examination 
of cells to identify thousands of possible methylation events 
is not possible. However, this method is not capable of relia- 
bly analyzing minute fragments from small sample quantities. In 
spite of protection against diffusion, such samples are lost 
through the matrix. 

6. State of the art with respect to Methylation and the diagno- 
sis of human diseases 

In the past, modification of the methylation pattern was 
analyzed in order to study and understand the genetic mecha- 
nisms which are involved in the outbreak or the progression of 
a disease. All this research was done in a piece-by-piece fash- 
ion by studying only one gene/chromosomal region at a time and 
no diagnosis/therapeutic treatment regimen was based on the me- 
thylation pattern modifications. In fact, the type of disease 
associated with the modification of the methylation pattern was 
known before methylation analysis was performed. Therefore, the 
following publications only indicate the widespread connection 
between modifications of the methylation patterns and human 
diseases. Modifications can include both hyper- or hypomethyla- 
tion of selected sites of the DNA. 

Disease associated with a modification of the methylation 
patterns are, for example: 
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- Leukemia (Aoki E et al. "Methylation status of the 
pl5INK4B gene in hematopoietic progenitors and peripheral blood 
cells in myelodysplastic syndromes" Leukemia 2000 
Apr; 14 (4) : 586-93; Nosaka K et al . "Increasing methylation of 
the CDKN2A gene is associated with the progression of adult T- 
cell leukemia" Cancer Res 2000 Feb 15; 60 (4) : 1043-8; Asimakopou- 
los FA et al. "ABLl methylation is a distinct molecular event 
associated with clonal evolution of chronic myeloid leukemia" 
Blood 1999 Oct 1; 94 (7 ): 2452-60; Fajkusova L. et al. "Detailed 
Mapping of Methylcytosine Positions at the CpG Island Surround- 
ing the Pa Promoter at the bcr-abl Locus in CML Patients and in 
Two Cell Lines, K562 and BV173" Blood Cells Mol Dis 2000 
Jun;26 (3) : 193-204; Litz CE et al. "Methylation status of the 
major breakpoint cluster region in Philadelphia chromosome 
negative leukemias" Leukemia 1992 Jan; 6 { 1) : 35-41 ) 

- Head and neck cancer (Sanchez-Cespedes M et al. "Gene 
promoter hypermethylation in tumors and serum of head and neck 
cancer patients" Cancer Res 2000 Feb 15; 60 (4) : 892-5) 

- Hodgkin's disease (Garcia JF et al . "Loss of pl6 protein 
expression associated with methylation of the pl6INK4A gene is 
a frequent finding in Hodgkin's disease" Lab Invest 1999 
Dec;79 (12) : 1453-9) 

- Gastric cancer (Yanagisawa Y et al. "Methylation of the 
hMLHl promoter in familial gastric cancer with microsatellite 
instability" Int J Cancer 2000 Jan 1;85 (1) : 50-3) 
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- Prostate cancer (Rennie PS et al. "Epigenetic mechanisms 
for progression of prostate cancer" Cancer Metastasis Rev 1998- 
99;17 (4) :401-9) 

- Renal cancer (Clifford SC et al. "Inactivation of the 
von Hippel-Lindau (VHL) tumor suppressor gene and allelic 
losses at chromosome arm 3p in primary renal cell carcinoma: 
evidence for a VHL-independent pathway in clear cell renal tu- 
mourigenesis" Genes Chromosomes Cancer 1998 Jul; 22 (3) : 200-9) 

- Bladder cancer (Sardi I et al . "Molecular genetic al- 
terations of c-myc oncogene in superficial and locally advanced 
bladder cancer" Eur Urol 1998 ; 33 (4 ): 424-30) 

- Breast cancer (Mancini DN et al . "CpG methylation within 
the 5* regulatory region of the BRCAl gene is tumor specific 
and includes a putative CREB binding site" Oncogene 1998 Mar 
5; 16 (9) : 1161-9; Zrihan-Licht S et al. "DNA methylation status 
of the MUCl gene coding for a breast-cancer-associated protein" 
Int J Cancer 1995 Jul 28; 62 (3) : 245-51; Kass DH et al . "Examina- 
tion of DNA methylation of chromosomal hot spots associated 
with breast cancer" Anticancer Res 1993 Sep-Oct; 13 (5A) : 1245-51) 

- Burkitt's lymphoma (Tao Q et al. "Epstein-Barr virus 
(EBV) in endemic Burkitt's lymphoma: molecular analysis of pri- 
mary tumor tissue" Blood 1998 Feb 15; 91 (4) : 1373-81) 

- Wilms tumor (Kleymenova EV et al . "Identification of a 
tumor-specific methylation site in the Wilms tumor suppressor 
gene" Oncogene 1998 Feb 12; 16 (6) : 713-20) 
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- Prader-Willi/Angelman syndrome (Zeschnigh et al. "Im- 
printed segments in the human genome: different DNA methylation 
patterns in the Prader-Willi/Angelman syndrome region as deter- 
mined by the genomic sequencing method" Human Mol . Genetics 
(1997) (6)3 pp 387-395; Fang P et al. "The spectrum of muta- 
tions in UBE3A causing Angelman syndrome" Hum Mol Genet 1999 
Jan;8(l) :129-35) 

- ICF syndrome (Tuck-Muller et al. "CMDNA hypomethylation 
and unusual chromosome instability in cell lines from ICF syn- 
drome patients" Cytogenet Cell Genet 2000; 89 (1-2) : 121-8) 

- Dermatofibroma (Chen TC et al . "Dermatofibroma is a 
clonal proliferative disease" J Cutan Pathol 2000 Jan; 27 (1) : 36- 
9) 

- Hypertension (Lee SD et al. "Monoclonal endothelial cell 
proliferation is present in primary but not secondary pulmonary 
hypertension" J Clin Invest 1998 Mar 1; 101 (5) : 927-34) 

- Pediatric Neurobiology (Campos-Castello J et al. "The 
phenomenon of genomic "imprinting" and its implications in 
clinical neuropediatrics" Rev Neurol 1999 Jan 1-15;28 (1) : 69-73) 

- Autism (Klauck SM et al. "Molecular genetic analysis of 
the FMR-1 gene in a large collection of autistic patients" Hum 
Genet 1997 Aug; 100 (2 ): 224-9) 

- Ulcerative colitis (Gloria L et al. "DNA hypomethylation 
and proliferative activity are increased in the rectal mucosa 
of patients with long-standing ulcerative colitis" Cancer 1996 



Dec 1;78 (11) :2300-6) 

- Fragile X syndrome (Hornstra IK et al. "High resolution 
methylation analysis of the FMRl gene trinucleotide repeat re- 
gion in fragile X syndrome" Hum Mol Genet 1993 Oct;2 (10) : 1659- 
65) 

- Huntington's disease (Ferluga J et al . "Possible organ 
and age-related epigenetic factors in Huntington's disease and 
colorectal carcinoma" Med Hypotheses 1989 May; 29 { 1) : 51-4 ) 

All the above-cited documents are hereby incorporated by 
reference . 

In view of the above, despite the advance in the art of 
the analysis of gene expression, a screening and/or the diagno- 
sis for a potential disease or medical condition is still a 
laborious and time consuming task, since in order to achieve a 
reliable result one has to analyse a vast number of differently 
expressed genes in parallel. 

This makes the analyses unreliable, time consuming, expen- 
sive, non-automateable and limits it to the analysis of single 
genes. No methods exist so far which address these problems to 
reasonably scale down the effort which has to be applied in or- 
der to achieve a result while maintaining its statistical qual- 
ity. 

Therefore, it is an object of the present invention to 
provide a method which allows for a streamlining of the effort 
for a reliable cell expression analysis. This analysis shall 



then be used for less time consuming and more effective diag- 
nostic and therapeutic purposes, in particular in personalised 
medical treatments . 

A further object of the invention is to provide the gene 
panel which is obtained using a method according to the inven- 
tion for the use in a treatment of a disease and/or medical 
condition . 

A further object of the invention is to provide systems, 
methods and computer program products for performing any of the 
inventive methods. 

A further object of the invention is to provide a treat- 
ment of a disease and/or medical condition, based on a gene 
panel according to the invention. 

This object is solved according to the present invention 
by providing a method for the development of gene panels for 
diagnostic and therapeutic purposes, which comprises the steps 
of: a) isolating at least one biological sample from each of at 
least two groups of biological material containing mRNA and/or 
proteins; b) analysing the expression level of at least one 
gene in the at least one biological samples; c) selecting the 
gene(s) exhibiting a different expression level between said at 
least two groups of biological material, whereby a first knowl- 
edge base is generated; d) analysing the level of cytosine me- 
thylation in the methylation relevant regions of at least one 
gene of at least one of the biological samples of step a). 



2a 



wherein the gene is selected on the basis of the first knowl- 
edge base; e) selecting the gene(s) exhibiting a different 
level of cytosine methylation between said at least two groups 
of biological material, whereby a second knowledge base is gen- 
erated; and f) adding selected genes from the second knowledge 
base to a gene panel. 

Thus, the present invention provides a method for generat- 
ing a gene panel combining only the advantages of the presently 
known expression analysis techniques, which results in a power- 
ful tool for the fast and reliable diagnosis and/or therapy of 
a enormous number of diseases and/or medical conditions. 

Cells et al. {„Gene expression profiling: monitoring tran- 
scription and translation products using DNA microarrays and 
proteomics^' FEES Lett 2000 Aug 25; 480 ( 1) : 2-16) describe a theo- 
retical approach to combining different technologies such as 
DNA microarrays and proteomics, which have made possible the 
analysis of the expression levels of multiple genes simultane- 
ously both in health and disease. In combination, these tech- 
nologies are said to revolutionise biology, in particular in 
the area of molecular medicine as they are expected to reveal 
gene regulation events involved in disease progression as well 
as to pinpoint potential targets for drug discovery and diag- 
nostics. Cells et al. review the current status of these tech- 
nologies and highlight some studies in which they have been ap- 
plied in concert to the analysis of biopsy specimens. Neverthe- 
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less. Cells et al. fail to describe or propose the combination 
of, in particular, data obtained in proteomics expression stud- 
ies and methylation analyses in order to provide gene panels 
for further therapeutic or diagnostic purposes. 

Other preferred embodiments of the invention will become 
apparent to the person skilled in the art after reading the 
features of the dependent claims. 

In one embodiment of the method according to the inven- 
tion, the biological sample is isolated by means of a biopsy, 
by means of an operation on an individual, by means of a dis- 
section, derived from a preserved biological sample, collected 
from body fluid (s) and/or collected directly from the environ- 
ment . 

In another embodiment of the method according to the in- 
vention, the biological sample comprises a eucaryotic and/or 
procaryotic cell line, a biopsy sample, blood, sputum, faeces, 
urine, cerebral liquid, tissue embedded in paraffin, tissue de- 
rived from eyes, intestine, brain, heart, prostata, kidney, 
lung, breast or liver, histological samples or a combination 
thereof. 

A preferred method according to the invention is charac- 
terised in that the at least one biological sample is derived 
from biological material of healthy and/or diseased individu- 
als. Such diseases include all diseases and/or medical condi- 
tions which involve a modification of the expression of genes 
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of the cell and include, for example, unwanted side effects of 
medicaments, cancers, metastasis, dysfunctions, damages or dis- 
eases of the central nervous system (CNS) , aggressive symptoms 
or behavioural disorders, clinical, psychological and social 
consequences of brain injuries, psychotic disorders and disor- 
ders of the personality, dementia and/or associates syndromes, 
cardiovascular diseases, malfunctions or damages, diseases, 
malfunctions or damages of the gastrointestine, diseases, mal- 
functions or damages of the respiratory system, injury, inflam- 
mation, infection, immunity and/or reconvalescence, diseases, 
malfunctions or damages as consequences of modifications in the 
developmental process, diseases, malfunctions or damages of the 
skin, muscles, connective tissue or bones, endocrine or meta- 
bolic diseases, malfunctions or damages, headache, and sexual 
malfunctions or combinations thereof, leukemia, head and neck 
cancer, Hodgkin's disease, gastric cancer, prostate cancer, re- 
nal cancer, bladder cancer, breast cancer, Burkitt's lymphoma, 
Wilms tumor, Prader-Willi/Angelman syndrome, ICF syndrome, der- 
matofibroma, hypertension, pediatric neurobiological diseases, 
autism, ulcerative colitis, fragile X syndrome, and Hunting- 
ton's disease. 

A preferred method according to the invention is charac- 
terised in that the isolation of said biological sample com- 
prises isolating subcellular compartments, organelles, macromo- 
lecular structures and multiprotein complexes, partial or com- 
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plete preparation of the itiRNA, reverse transcription or partial 
digestion of the material with an enzyme selected from prote- 
ases, RNAses and/or DNAses or combinations thereof. Such isola- 
tions and/preselections can initially even further limit the 
amount and complexity of the genes which take part in the in- 
ventive method. 

Further preferred is a method according to the invention 

• : 

in which the analysis of the expression level of the at least 
Ff, one gene in the biological sample comprises determining the 

X i 

j« relative amount of mRNA or protein derived from said at least 

SI 

m one gene. In general, all currently known methods for the 

n expression analysis of genes except methylation analysis can be 

Ul 

|jl used for this first step of the inventive method. Preferred are 

fi analyses that comprise one or more of one- or two-dimensional 

K 

gel electrophoresis, differential display, analysis of selected 
sets of tumour markers, subtractive hybridisation, mass spec- 
trometry, comparative expressed sequence tag sequencing, repre- 
sentational difference analysis, cDNA or oligonucleotide ar- 
rays, serial analysis of gene expression, enzymatic, fluores- 
cent, radioactive, dye and/or antibody labelling. Even more 
preferred are methods according to the invention which comprise 
measuring intensities of expression during one- or two- 
dimensional gel electrophoresis, differential display, subtrac- 
tive hybridisation, DNA, RNA or protein sequencing, mass spec- 
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trometry, and enzymatic, radioactive, dye and/or antibody la- 
belling. 

In another embodiment of the method according to the in- 
vention, the analysis is at least partially performed by means 
of a suited automate, for example a robot and/or a computer de- 
vice. Such device would be equipped with the necessary software 
for the analysis of the expression and could be connected to an 
inter- or intranet, be part of a neural network or the like. 
The necessary data/information for the analyses can be present 
on the system directly or at a remote source, to which the de- 
vice is directly or indirectly connected, for example via the 
internet . 

In a preferred method according to the invention, the ex- 
pression levels of at least two genes are analysed in parallel. 
More preferably, at least 100 genes are analysed in parallel. 

In another preferred embodiment of the method according to 
the invention the selection is based on a combination of the 
analysis of both mRNA level and protein expression. In another 
embodiment of the inventive method, the selection is based on 
the result of at least two individual rows of analyses. This 
provides for an internal control run of the data which is used 
for the selection and increases the preciseness of the results. 
This will further reduce the statistical error for the value of 
the expression of a selected gene with an only limited increase 
of the costs for the analysis. 
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In another preferred method according to the invention, 
the selection is performed in such a way as to give a first 
knowledge base comprising only one set of selected genes. Thus, 
the knowledge base will comprise only ,,on'' and „off type of 
data which allows for a very simple decision between expressed 
or non-expressed genes. In yet another embodiment of the inven- 
tive method, the selection is performed in such a way as to 
p give a first knowledge base comprising different subsets of se- 

03 lected genes. Such classes can be referred to as ^quality 

41 classes^' which allows for a much more differentiated selection 

m 

01 of the gene panel. The term ^quality classes'' as used herein 

s 

□ comprises all different possibilities of groupings the differ- 

m 

yj ent expression levels. Such grouping could, for example, in- 

□ elude different importance for the selected sites for the 

m 

analysis of the expression as * well as statistical preciseness 
and/or quality of the analysis data of the selected gene. 

In a preferred method according to the invention, the se- 
lection is at least partially performed automatically by means 
of a suited automate, like a computer device. Such device would 
be equipped with the necessary software for the analysis of the 
expression and could be connected to an inter- or intranet, be 
part of a neural network or the like. The necessary 
data/information for the analyses can be present on the system 
directly or at a remote source, to which the device is directly 
or indirectly connected, for example via the internet. 
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In a preferred method according to the invention, at least 
two genes are selected in parallel. More preferably, at least 
100 sites are selected in parallel. 

According to the invention it is further preferred to ana- 
lyse methylation relevant regions comprising the complete genes 
and/or promoters, introns, first exons and/or enhancers of the 
genes to be analysed. From the analysis of the methylation 
sites which are relevant for the expression of a certain gene, 
but not localised inside the sequence of the gene itself, the 
effect of the site for the expression of the gene can be read- 
ily extrapolated by the person skilled in the art. 

In another embodiment of the method according to the in- 
vention, the analysis of the level of cytosine methylation com- 
prises chemical treatment with bisulphite, hydrogen sulphite or 
disulphite, polymerase chain reaction (PGR), hybridisation 
analyses, sequencing, mass spectrometry and fluorescent, enzy- 
matic, radioactive, dye and/or antibody labelling. In general, 
all methods for the analysis of the methylation statuses at se- 
lected sites of the DNA can be employed. Such methods are known 
to the skilled artisan and are described in, for example, Dahl 
et al., ^Analysis of in vivo methylation.'' Methods Mol Biol 
2000;130:47-57; Zhou Y. et al., „Use of a single sequencing 
termination reaction to distinguish between cytosine and 5- 
methylcytosine in bisulf ite-modif ied DNA.'' Biotechniques 1997 
May;22 (5) : 850-4; Yoder JA et al. ^Genetic analysis of genomic 
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methylation patterns in plants and mammals." C Biol Chem 1996 
Oct; 377 (10) : 605-10 and others. 

Preferred is a method according to the invention in which 
the analysis is at least partially performed by means of a 
suited automate, for example a robot. Such device would be 
equipped with the necessary software for the analysis of the 
expression and could be connected to an inter- or intranet, be 
part of a neural network or the like. The necessary 
data/information for the analyses can be present on the system 
directly or at a remote source, to which the device is directly 
or indirectly connected, for example via the internet. 

Another preferred method according to the invention is 
characterised in that all potentially methylated sites of the 
DNA are analysed. Such sites usually include all so-called 
„CpG"-islands on a given DNA sequence and are readily detect- 
able by the person skilled in the art. Preferably, the level of 
cytosine methylation of at least two genes are analysed in par- 
allel. Preferably, the level of at least 100 cytosine methyla- 
tion sites is analysed in parallel. The analysis of a multitude 
of sites in parallel allows for both a effective screening and 
a statistically highly relevant result of the method. 

A further preferred method according to the invention is 
characterised in that the selection is based on the result of 
at least two individual rows of analyses. This will reduce the 
statistical error for the value of the methylation sensitivity 
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of a selected site with an only limited increase of the costs 
for the analysis. In another preferred method according to the 
invention, the selection is performed in such a way to give a 
second knowledge base comprising only one set of selected 
genes. Thus, the knowledge base will comprise only „on'' and 
,,off^^ type of data which allows for a very simple decision be- 
tween different methylation states. 

In yet another embodiment of the inventive method, the se- 
lection is performed in such a way to give a second knowledge 
base comprising different subsets of selected genes. Such 
classes can be referred to as ,,quality classes^' which allows 
for a much more differentiated analysis. 

In a preferred method according to the invention, the se- 
lection is at least partially performed automatically by means 
of a suited automate, like a computer device. Such device would 
be equipped with the necessary software for the analysis of the 
methylation sites and could be connected to an inter- or intra- 
net, be part of a neural network or the like. The necessary 
data/information for the analyses can be present on the system 
directly or at a remote source, to which the device is directly 
or indirectly connected, for example via the internet. 

In a preferred method according to the invention, at least 
two genes are selected in parallel. More preferably, at least 
100 genes are selected in parallel. 
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Another embodiment of the method according to the inven- 
tion is characterised in that all or a part of the genes of the 
second knowledge base are added to the gene panel. 

In another embodiment of the method according to the in- 
vention, additional information about methylation relevant re- 
gions of the selected genes is added to the gene panel. This 
additional information can comprise personal patient data, dis- 
ease specific data, prior treatment data and/or additional me- 
thylation specific data. 

Another embodiment of the method according to the inven- 
tion is characterised in that steps a) to f) are repeated. Re- 
peating the method of the invention suits several different 
purposes. First, as mentioned above, the statistical quality of 
the of the resulting data increases. Second, an internal con- 
trol can be provided, whether the biological sample was taken 
correctly and resembles e.g. the tissue of interest. The number 
of repeating ,,cycles'' of the invention can vary depending on 
the individual case, e.g. depending on the quality of the sam- 
ple to be analyse. One possibility would be to repeat the 
method of the invention for at least 5 to 50 times. Preferably, 
such method according to the invention is characterised in that 
the method is at least partially performed by means of a suited 
automate, for example a robot and/or a computer system. The in- 
ventive method can be conveniently automated and/or computer- 
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ized and respective devices and programs are readily known to 
the person skilled in the art. 

In another aspect according to the method according to the 
invention, the identical biological material, different bio- 
logical material or a combination thereof is used in step a) . 
Further, in another aspect according to the method according to 
the invention, the steps are performed in the following order: 
step a), step d) , step e) , step b) , step c) , and step f ) . This 
simply interchanges the order of the different steps of the in- 
ventive method which should nevertheless lead to a similar or 
identical result. According to the invention, this method can 
also at least partially be performed by means of a suited auto- 
mate, for example a robot. 

Another aspect of the present invention is related to a 
gene panel which is obtained according to a method according to 
the invention. This gene panel can be further used for the di- 
agnosis and/therapy of different diseases, like, for example, 
unwanted side effects of medicaments, cancers, metastasis, dys- 
functions, damages or diseases of the central nervous system 
(CNS), aggressive symptoms or behavioural disorders, clinical, 
psychological and social consequences of brain injuries, psy- 
chotic disorders and disorders of the personality, dementia 
and/or associates syndromes, cardiovascular diseases, malfunc- 
tions or damages, diseases, malfunctions or damages of the gas- 
trointestine, diseases, malfunctions or damages of the respira- 
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tory system, injury, inflammation, infection, immunity and/or 
reconvalescence, diseases, malfunctions or damages as conse- 
quences of modifications in the developmental process, dis- 
eases, malfunctions or damages of the skin, muscles, connective 
tissue or bones, endocrine or metabolic diseases, malfunctions 
or damages, headache, and sexual malfunctions or combinations 
thereof, leukemia, head and neck cancer, Hodgkin's disease, 
gastric cancer, prostate cancer, renal cancer, bladder cancer, 
breast cancer, Burkitt's lymphoma, Wilms tumor, Prader- 
Willi/Angelman syndrome, ICF syndrome, dermatofibroma, hyper- 
tension, pediatric neurobiological diseases, autism, ulcerative 
colitis, fragile X syndrome, and Huntington's disease. 

Further preferred is a gene panel according to the inven- 
tion which comprises additional information data about methyla- 
tion relevant regions of the selected genes, like the regions 
of the complete genes and/or promoters, introns, first exons 
and/or enhancers. According to the invention, the gene panel 
can be present in the form of a knowledge base on a computer 
disc, RAM, ROM, or a printed table or listing. 

According to the present invention, a „gene panel'" desig- 
nates a knowledge base, listing, table or other information 
source, that contains information about selected genes, herein 
also designated as ^candidate genes''. According to the present 
invention, the term „gene panel" should not be understood as 
merely containing information about the names or designations 
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of the candidate genes. The panel further can contain addi- 
tional information about the candidate genes, like sequence 
data, information about the origin (heredity) of the gene, spe- 
cies information, and information about the genetic elements 
and/or factors that influence expression of the candidate 
gene{s). Such elements can be the complete genes and/or promot- 
ers, introns, first exons and/or enhancers of the candidate 
genes. Factors can be growth conditions, developmental stage of 
the biological material from which the candidate gene is de- 
rived or other medical data. 

In another aspect of the invention, the gene panel can be 
used for the diagnosis of a disease. Such diseases can include 
unwanted side effects of medicaments, cancers, metastasis, dys- 
functions, damages or diseases of the central nerval system 
(CNS) , aggressive symptoms or behavioural disorders, clinical, 
psychological and social consequences of brain injuries, psy- 
chotic disorders and disorders of the personality, dementia 
and/or associates syndromes, cardiovascular diseases, malfunc- 
tions or damages, diseases, malfunctions or damages of the gas- 
trointestine, diseases, malfunctions or damages of the respira- 
tory system, injury, inflammation, infection, immunity and/or 
reconvalescence, diseases, malfunctions or damages as conse- 
quences of modifications in the developmental process, dis- 
eases, malfunctions or damages of the skin, muscles, connective 
tissue or bones, endocrine or metabolic diseases, malfunctions 
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or damages, headache, and sexual malfunctions or combinations 
thereof. Particularly preferred is a use according to the in- 
vention which is characterised in that the genes are related 
with leukemia, head and neck cancer, Hodgkin's disease, gastric 
cancer, prostate cancer, renal cancer, bladder cancer, breast 
cancer, Burkitt's lymphoma, Wilms tumor, Prader-Willi/Angelman 
syndrome, ICF syndrome, dermatofibroma, hypertension, pediatric 
neurobiological diseases, autism, ulcerative colitis, fragile X 
syndrome, and Huntington's disease. 

Another aspect of the present invention is related to a 
device for the generation of a gene panel according to the in- 
vention, which comprises means for generating a first and sec- 
ond knowledge base according to invention and means for adding 
selected genes from the second knowledge base to a gene panel. 

Another aspect of the present invention is related to a 
method for the diagnosis of a disease, comprising the following 
steps: a) providing a gene panel according to the invention; b) 
analysing the level of cytosine methylation at selected sites 
of the DNA based on said gene panel in biological material of 
at least one diseased individual with a known disease or medi- 
cal condition and/or at least one healthy individual, thereby 
generating a first knowledge base; c) analysing the level of 
cytosine methylation at selected sites of the DNA based on said 
gene panel in biological material of at least one diseased in- 
dividual with an unknown disease or medical condition, thereby 
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generating a second knowledge base; and d) providing a third 
knowledge base comprising a plurality of expert rules for com- 
paring the first and second knowledge base; e) selecting a type 
of disease or medical condition for the at least one diseased 
individual with an unknown disease or medical condition based 
on said first to third knowledge bases. 

A preferred method for the diagnosis of a disease is char- 
acterised in that the selected sites of the genes to be ana- 
lysed are located in the promoters, introns, first exons and/or 
enhancers or combinations thereof. The inventive method can 
further comprise the analysis of the level of cytosine methyla- 
tion comprises chemical treatment with bisulphite, hydrogen 
sulphite or disulphite, polymerase chain reaction (PGR), hy- 
bridisation analyses, sequencing, mass spectrometry and fluo- 
rescent, enzymatic, radioactive, dye and/or antibody labelling. 
Preferably, the method according to the invention is character- 
ised in that the analysis is at least partially performed by 
means of a suited automate, for example a robot. 

Further, in another embodiment of the inventive method, 
steps b) to d) are repeated before performing step e) . Prefera- 
bly, the identical biological material, different biological 
material or a combination thereof is used in step c) . 

In a preferred embodiment, the method according to the in- 
vention is at least partially performed by means of a suited 
automate, for example a robot. 
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In another aspect of the invention^ one embodiment of the 
inventive method is used for the diagnosis of unwanted side ef- 
fects of medicaments, cancers, dysfunctions, damages or dis- 
eases of the central nerval system (CNS) , aggressive symptoms 
or behavioural disorders, clinical, psychological and social 
consequences of brain injuries, psychotic disorders and disor- 
ders of the personality, dementia and/or associates syndromes, 
cardiovascular diseases, malfunctions or damages, diseases, 
malfunctions or damages of the gastrointestine, diseases, mal- 
functions or damages of the respiratory system, injury, inflam- 
mation, infection, immunity and/or reconvalescence, diseases, 
malfunctions or damages as consequences of modifications in the 
developmental process, diseases, malfunctions or damages of the 
skin, muscles, connective tissue or bones, endocrine or meta- 
bolic diseases, malfunctions or damages, headache, and sexual 
malfunctions or combinations thereof. Particularly preferred is 
a use for the diagnosis of leukemia, head and neck cancer, 
Hodgkin's disease, gastric cancer, prostate cancer, renal can- 
cer, bladder cancer, breast cancer, Burkitt's lymphoma, Wilms 
tumor, Prader-Willi/Angelman syndrome, ICF syndrome, dermatofi- 
broma, hypertension, pediatric neurobiological diseases, au- 
tism, ulcerative colitis, fragile X syndrome, and Huntington's 
disease. 

In an even further aspect, the invention provides a method 
for the treatment of a disease or medical condition which com- 
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prises a) providing at least one diagnosis according to a 
method as mentioned above; and b) installing a specific treat- 
ment for said at least one diagnosed disease or medical condi- 
tion. Preferably, said specific treatment is disease specific 
and/or personalised. In one embodiment this method is used for 
the treatment of unwanted side effects of medicaments, cancers, 
dysfunctions, damages or diseases of the central nerval system 
(CNS) , aggressive symptoms or behavioural disorders, clinical, 
psychological and social consequences of brain injuries, psy- 
chotic disorders and disorders of the personality, dementia 
and/or associates syndromes, cardiovascular diseases, malfunc- 
tions or damages, diseases, malfunctions or damages of the gas- 
trointestine, diseases, malfunctions or damages of the respira- 
tory system, injury, inflammation, infection, immunity and/or 
reconvalescence, diseases, malfunctions or damages as conse- 
quences of modifications in the developmental process, dis- 
eases, malfunctions or damages of the skin, muscles, connective 
tissue or bones, endocrine or metabolic diseases, malfunctions 
or damages, headache, and sexual malfunctions or combinations 
thereof. Most preferred is a use for the treatment of leukemia, 
head and neck cancer, Hodgkin's disease, gastric cancer, pros- 
tate cancer, renal cancer, bladder cancer, breast cancer, 
Burkitt's lymphoma, Wilms tumor, Prader-Willi/Angelman syn- 
drome, ICF syndrome, dermatofibroma, hypertension, pediatric 
neurobiological diseases, autism, ulcerative colitis, fragile X 
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syndrome, and Huntington's disease. 

The invention shall now be explained in more detail by the 
following examples with reference to the attached listing of 
genes and the accompanying drawings without limiting the scope 
of the concept of the invention. 

The attached sequence listing shows a panel of genes, 
whose methylation statuses are altered depending on cellular 
states as indicated above each subgroup of the listing. 

EXAMPLES 

Example 1 

Proteomics plus subsequent methylation screening 

In this example, a proteomics-derived step was used in or- 
der to analyse the expression level of a set of proteins. 
First, a 2-D Gelelectrophoresis according to standard protocols 
(see above) was perfomed for both a prostate cancer cell line 
and cells derived from a healthy prostate in which a staining 
with Sypro Ruby dye was used. Then, the resulting gels were 
scanned using a CCD-camera and the scanned picture were ana- 
lysed using a computer-based analysis software, e.g. ^^Imagemas- 
ter" (Amersham- Pharmacia) or ^'Z3" (Compugen) . 

Proteins that were differently expressed in both analysis 
pattern were excised by a robot (Flexys robot, genomic solu- 
tions) and tryptically digested. The peptides were analysed us- 
ing a MALDI-TOF mass spectrometry. The resulting fragments were 
analysed via peptide mapping and compared to the protein data- 
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base (which one) . Finally, the differently expressed proteins 
were listed and displayed. This listing included several dif- 
ferently expressed proteins. 

On the basis of the listing obtained in the first step, a 
methylation analysis was performed according to the methods de- 
scribed above. The analysis revealed, that the methylation 
statuses of the several proteins were significantly different 
compared to the non-prostate cancer cell line. 

Taken together, these selected genes can be further used 
in diagnosis and/or therapy as a minimal set of markers for 
prostate cancer while providing a result of maximal reliably. 



